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Abstract — We propose a regularized saddle-point algorithm 
for convex networked optimization problems with resource 
allocation constraints. Standard distributed gradient methods 
suffer from slow convergence and require excessive communi- 
cation when applied to problems of this type. Our approach 
offers an alternative way to address these problems, and ensures 
that each iterative update step satisfies the resource allocation 
constraints. We derive step-size conditions under which the 
distributed algorithm converges geometrically to the regularized 
optimal value, and show how these conditions are affected by 
the underlying network topology. We illustrate our method on 
a robotic network application example where a group of mobile 
agents strive to maintain a moving target in the barycenter of 
their positions. 

I. Introduction 

Recent years have witnessed an increased interest in 
the development of distributed optimization algorithms for 
diverse applications involving networks of systems, such as 
multi -robot coordination [1], distributed estimation [2] , and 
resource allocation in wireless systems [3]. These algorithms 
are tailored to accommodate the distributed computation and 
network constraints inherent in peer-to-peer communication 
based applications. 

In this paper we focus on primal-dual (saddle-point) 
iterative methods for solving convex optimization problems 
with resource allocation constraints in addition to convex 
constraints on local variables and sparse (convex) coupling 
constraints. Standard (sub)gradient methods [4], [5] have 
gained increased attention in the past years, yet their con- 
vergence rate is known to be rather low especially when 
the cost function is non-strictly convex. A recently proposed 
method [6], [7] regularizes the initial convex problem and 
thereby increases the convergence rate of common algo- 
rithms delivering a solution arbitrarily close to the one of 
the original problem. Motivated by this strategy, we also 
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make use of regularization and solve the resulting strictly 
convex problem via a saddle-point method. Furthermore, we 
incorporate the resource allocation equality constraints di- 
rectly into the saddle-point iterations by extending the results 
of [8] (originally proposed for unconstrained problems). We 
derive step-size conditions that guarantee convergence of our 
iterative scheme, and show how these results are linked to the 
problem characteristics and the graph topology, respectively. 
In standard dual decomposition approaches [9] one would 
typically dualize the equality constraints and use consensus 
mechanisms to distribute the resulting Lagrangian function 
over the network. This technique results in a slow conver- 
gence rate, especially if the network is sparsely connected. 
Our proposed approach incorporates the resource allocation 
constraints directly in the saddle-point iteration, and uses 
regularization to obtain faster convergence. This leads to 
an inherently distributed method, which converges to the 
solution of the original regularized problem. 

We illustrate our algorithm on a realistic robotic network 
example where a number of mobile robots strive to keep 
a moving target in the barycenter of their positions. This 
scenario is motivated by the recent interest in target tracking 
and target circumnavigation, e.g. [10], [11], where distributed 
algorithms are required to be applicable in real-time. 

The paper is organized as follows. Section II formulates 
the problem. In Section III we propose the regularized 
saddle-point algorithm, whose convergence properties are 
studied in Section IV. Section V describes the application 
scenario that is used to illustrate our method. Finally, in 
Section VI we draw our conclusions and plans for future 
research directions. 

II. Problem Formulation 

We use the standard notation 1„ and /„ to indicate a 
vector of all ones of dimension n and an identity matrix 
of dimensions n x n, respectively. The symbol ® denotes 
the Kronecker product, (-, ■) is the dot product, while || • || 
is the 2-norm. For a real symmetric matrix A, the notation 
A >- indicates that A is positive definite, (A) denotes its 



i-th smallest eigenvalue whereas the largest one is A max (A). 

We study constrained optimization problems on a network 
of computing nodes. The network is modeled as a connected 
graph Q = (V, £), with vertices (nodes) in the set V = 
{1, . . . , N} and pairs of nodes as edges in the setf C VxV. 
We denote the cardinality of £ as E, the set of neighbors of 
node i as A/i = £ £}, while L is the Laplacian of 

the graph Q, [12]. 

We consider the following convex constrained optimiza- 
tion problem 

minimize Y^iLi fi( x i) ( la ) 
subject to gij(xi,Xj)<0 for all (i, j) £ £ (lb) 
h(xi) < 0, for i = 1,...,N (lc) 

Z)»=l x i = ^tot (Id) 

where each variable Xi £ K™ is associated to the node i, each 
function /< : K" -> R, #y : R 2 ™ -> R, and /i 4 : R™ — > R is 
a continuously differentiable convex function. The constant 
vector xtot £ K™ dictates the total amount of each resource 
that is available in the system and therefore defines the 
resource allocation constraint 1 . We refer to problem (1) as 
the primal problem. 

Let x £ W lN be the stacked vector x = (xj , . . . , xJf) T , 
while f(x) is a separable cost function f(x) = Y?,i=i fi( x i)> 
and g(x) denotes in a compact stacked form all the separable 
constraint functions described in (lb)-(lc), with codomain 
(or target set) R"\ m = E + N. With this notation we can 
rewrite problem (1) in the compact form 

minimize f( x ) (2a) 

X 

subject to g(x) < (2b) 

(l^ <g> I n ) X = Xtot (2c) 

We assume that the constraints g(x) < define a closed 
and bounded convex set X. We assume also that the inter- 
section of X and the set defined by the equality constraint 
(ljf <8) I n ) x = Xtot is a closed, bounded, and non-empty 
convex set, denoted by X. Under these assumptions there 
exists a (possibly non-unique) optimizer of (1), which we 
indicate as x opt . We denote the primal optimal value by / opt . 

Problems of type (1) can be found in various domains 
including economics [13] and sensor networks [3]. Typically, 
in these fields the nodes have a coupling resource allocation 
constraint, for example the total monetary budget or the total 
available bandwidth, respectively. 

We are interested in solving the primal problem (1) via 
the use of iterative distributed algorithms. However, due to 

'For simplicity we consider here the case of Xtot £ K", although our 
analysis could be extended to handle cases in which Xtot £ K", with 
n < n. 



the facts that (i) the Lagrangian function associated with 
problem (1) is in general neither strictly convex in the primal 
variable x, nor strictly concave in the dual variables, and 
(ii) the resource allocation constraint couples all the nodes, 
standard primal-dual iterative methods have typically slow 
convergence rate and require high communication demand 
among the nodes. In order to address these issues, we study 
a regularized version of the saddle-point algorithm (primal- 
dual iterations) in the next section, which incorporates the 
resource allocation constraint directly in the update equa- 
tions. 

III. Regularized Saddle-Point Algorithm 

In this section, we present a distributed gradient-based 
optimization method that employs a fixed regularization in 
the primal and dual spaces. This regularization serves to ap- 
proximate the primal problem (1) in a way that can be solved 
by gradient-based methods with improved convergence prop- 
erties. Furthermore, we modify the primal iteration to ensure 
that each iterate satisfies the resource allocation constraint. 
This allows us to avoid the dualization of the equality 
constraint, which would need to be distributed among the 
nodes and lead to increased communication requirements. 

Let /i s R™ be the dual variable associated to the 
inequality constraint g(x) < 0, and v > 0, e > 
be strictly positive scalars. Motivated by [7], we define a 
regularized Lagrangian-type function associated to the primal 
problem (1) as 

C{x,n) :=f(x) + ^\\x\\ 2 +^g(x)~^e\\fi\\ 2 (3) 

This Lagrangian-type function is by definition a strictly 
convex function of the primal variable x and a strictly 
concave function of the dual variable /i. 

In order to leverage on strong duality relations, we use the 
following standard assumption. 

Assumption 1: There exists a Slater vector x £ X such 
that g[x) < 0. 

Our aim is to find an (approximate) solution of the primal 
problem (1), by solving the regularized saddle-point problem: 

maxmin C(x,fj.) (4) 

n x 

subject to (l^ ®I n )x = xtot 

whose optimal value is denoted by /*, and unique optimizer 
by x*. Under Assumption 1, the unique optimizer of the 
regularized problem (4) satisfies the KKT conditions: 

V ;c £(x*, A/ *) + (l^®/„) T p*=0 
V„£(a:*,p*)<0 

(iff ®I n ) X - Xtot = 

M*V Mg £(x*, M *)=0 forg=l,...,M 
H* q >0 for q = 1,...,M 



where fj,* and p* are the optimal Lagrangian multipliers 2 , 
while V x £ and V M £ indicate the gradients of the regularized 
Lagrangian-type function C(x, fi) with respect to x and [i. 

It is expected that in general the solutions of the primal 
problem (1) and the regularized saddle -point problem (4) are 
different, meaning \\x* - z opt || ^ and ||/* - f°v % \\ ^ 0. 
Furthermore, the solution of the regularized problem (4) 
does not necessarily satisfy the inequality constraints of the 
primal problem (1). However, it is possible to bound the 
suboptimality and the distance from the primal optimizer, 
along with the constraint violation by some function of 
the regularization parameters v and e. Thus while we are 
solving an approximation of the primal problem (1) we have 
bounds on the distance from the primal optimal solution. 
Furthermore, in this context the regularization procedure can 
be seen as a way to speed up the convergence of standard 
gradient-like methods, which may in fact lead to a closer 
iterate to the optimum / opt of the primal problem within 
a finite number of iterations even though an approximate 
regularized problem is being solved. For further details we 
refer the reader to the original works on regularization and 
double smoothing techniques [6], [7]. 

The regularized saddle-point problem (4) can be readily 
solved by centralized iterative methods. However, when 
a distributed solution is sought, the equality constraint is 
usually dualized and decomposed among the nodes, see 
for example the discussions in [14], [15]. Typically this 
procedure causes high communication load and the con- 
vergence rate would be affected by the number of nodes. 
In order to overcome these potential drawbacks we follow 
a different route and propose an extension to well-known 
iterative schemes that ensures the feasibility of each iterate 
with respect to the resource allocation constraint. The main 
idea can be thought of as "projecting" the iterates onto the 
feasible set of the equality constraint. This extension allows 
us to design an inherently distributed iterative scheme that 
still solves the original regularized problem (4). 

Let Vm + indicate the projection over the positive orthant, 
and let a > and /? > be fixed strictly positive 
scalars (step sizes). We consider the following saddle-point 
iterations: 



; (T+1) = -a/3(^(8)/n)V 2; £(x (r) ,/i (r) ) (5) 



(t+1) _ 



'+ Q V/( I W ,/ T) ) 

with any matrix W £ R NxN such that 
(a) the vectors Ijv and 1^ are left and right eigenvectors 

2 The Lagrangian multiplier p S 1" corresponds to the resource allocation 
equality constraint, which is used to describe the KKT conditions of 
the regularized problem, but not used in our proposed iterative solution 
approach. 



(6) 



of W associated to the zero eigenvalue, respectively: 

W1 N = 



1 T N W = 0, 



(b) the zero eigenvalue is unique, i.e., 

W + W T + (1/N)1 N 1 N >- 

(c) the matrix W has the same sparsity pattern as the 
Laplacian matrix L of the graph Q. 

It is easy to see that if the properties (a)-(c) hold, then 
the iterations (5)-(6) can be computed locally with only 
the information of the neighboring nodes. In this sense, the 
iterations (5)-(6) are inherently distributed. 

We claim that there exist conditions on the step-sizes 
a and (3 such that the iterations (5)-(6) converge to the 
unique optimal solution of the regularized problem (4). In 
particular, we expect that the step size a is linked to the 
characteristics of the functions / and g (as in standard 
gradient-like methods), while /3 is linked to W, i.e., the 
network topology. These relationships will be shown using 
the following lemma, which establishes three important 
properties of the iterations (5)-(6). 

Lemma 1: If the matrix W satisfies the property (a) and 
for the first iterate a;^ ^ the resource allocation constraint 
holds, i.e., (1^ ® I n )x^ = x to t, then 

(i) for any t, the iterate x^ satisfies the resource alloca- 



tion constraint, i.e., (1 N (8 I n )x 



(r) 



Xtot', 



(ii) the optimal couple of (4) is a fixed point of 

the iterations (5)-(6); 
(Hi) for any r, the equality a;( T+1 ) = x^ holds if and only 
if either V x C{x {t \ ^ (t) ) + (Ijv ® I n )p = 0, for some 
p e W\ or V K £(a: (T) ,M (T) ) = °- 
Proof. The first claim follows by induction based on [8]. 
Suppose that x^ T > satisfies the resource allocation constraint. 
Then for a;v r+1 ) 



[l N ®I n )x (T+1) 



(1 



TV ^ In 



) {x^ -ap{W®I n )V x L{x^,^)) 



and using the fact that (l^ <g) /„) (W <g> /„) = l^I^(g)7 n 
(property (a) of W) the claim follows. 

The second claim follows by direct calculations. Consider 
the optimal pair (x* , /!*) of (4), then using the KKT condi- 
tions we obtain 

a;( r+1 ) = x* - a(3(W <E> I n )W x C{x* , fx*) = 

x* + a(3 (W ® /„) (ljv ® I n ) P* ■ 

Since (W <g> /„) (In 8) I n ) = W1 N (g> I n = 0, it follows 
that a;( T+1 ) = a;* and therefore x* is a fixed point. 

The third claim follows from property (b) of W, i.e., the 
uniqueness of the zero eigenvalue. The equality x^ T+1 " > = 



x^ holds if and only if a/3(W 
tn 

if the vector \7 x C(x^ 



I n )V x C(x^,^) = 0. 

or 

/^ T )) is an eigenvector of W with 
associated zero eigenvalue. Therefore, using property (b) of 



This last equality is true either if V x C(x^ T \ ^ T ') 



W leads to V x C{x i - T \fi 



(ljv ® with p' e 



Choosing p' = —p proves the claim. □ 
Lemma 1 shows that the matrix W keeps the iterates 
feasible with respect to the resource allocation constraint and 
does not introduce undesired fixed points. 

The next section investigates the conditions on a and /3 
under which the primal-dual iterates x^ and /i^ 7 "' converge 
to the optimizer (x* 7 fi*) of (4), and the bounds on how far 
this solution is from the primal solution £ opt in terms of 
suboptimality ||/* — / opt || and constraint violation. 

IV. Convergence Properties 
Let z be the stacked vector z = (x T 



and define 

the mapping $(z) = (V x C(z) T , -V M £(z) T ) T . We use the 
short-hand notation W® = W ® I n . Moreover, let V be a 
generic projection operator, whose codomain will be clear by 
the context. The iterations (5)-(6) can be compactly written 
as: 

r PW 9 

hi 



,<v+i) 



$(z (t) ) 



T(z 



(7) 

The scope of this section is to identify the assumptions on 
£{x, /x) and the conditions on a and f3 that let the mapping 
T : R Nn xip R Nn x be a contraction mapping. This 
guarantees geometric convergence of the iterations (5)-(6) to 
the optimal point of (4). 

First of all we characterize the properties of the mapping 
$(z) under the following standard assumptions. 

Assumption 2: The iterates x^ and are contained in 
some closed, convex, and bounded sets for each iteration r. 
In other words, x (t) € X and yu (r) e M. 
We note that the assumption of G M is satisfied under 
Assumption 1 (see [7] for details). 

On the other hand, the assumption on x^ can be difficult 
to justify. Although if the sequence {x^} converges to x* 
we know that x^ is asymptotically bounded, there is no 
guarantee a priori that x^ is bounded in each iteration. 
Furthermore, we cannot simply project the x iterates in (5) 
on some closed, convex, and bounded set, since this would 
destroy the properties of the information exchange matrix W . 
In the Appendix we show a simple way to locally modify 
the function L(x,/i) in order to enforce that x^ E X, 
and therefore ensure the boundedness of the iterates x^ T \ 
Since this local modification does not change our converge 
analysis, we assume now that Assumption 2 is satisfied and 
we refer to the Appendix for the technical details. 

We make the following mild and technical assumptions: 



Assumption 3: The gradients of f(x) and each g q (x) are 
Lipschitz continuous with constants F and G q , respectively: 

\\V x f{a)-V x f{b)\\<F\\a-b\\, for a,bet 
||V a <7 e (a)-V a 5,(6)|| <G g \\a-b\\, for a,be±, 

q=l,...,m 

Assumption 4: The constraint gradient and Lagrangian 
dual variable are bounded as 

||V»5(x)|| < M d , and \\fi\\ < M„. 
We note that Assumptions 3 and 4 are commonly required 
in the analysis of gradient descent methods. Furthermore, 
Assumption 4 is generally satisfied under Assumption 2. 

Assumptions 1, 2, 3, and 4 are important to guarantee that 
the mapping $>(z) has certain regularity properties. In fact, 
under these assumptions, by Lemma 3.4 of [7], the mapping 
$(2:) is strongly monotone with constant Lp = minfV, e) and 
Lipschitz with constant F$. In other words, 

($(a) -$(&), a -b) >^||a-6|| 2 , fora,&6XxM (8) 

||$(o) - $(b)|| < F$\\a - b\\, foro,kXxM (9) 

Properties (8) and (9) will be important for convergence. 

A. Symmetric Case 

In this subsection we will assume that the matrix W is 
symmetric, i.e., W T — W . This will allow us to derive 
closed-form conditions for the step-sizes a and /3. Define 

C : =max(/3A max (M-0,l) (10) 
k := tp — F$ (/3X Ioax (W) — 1) (11) 

Theorem 1: Under the Assumptions 1, 2, 3, and 4 and for 
symmetric W, the conditions 



t (w) < i + JL i 



and 



a < 



2k 

cJf! 



(12) 



ensure geometrical convergence of the iterations (5)-(6) to 
the unique optimizer (x* , n*) of the regularized problem (4). 
Furthermore, the convergence rate r is 



r = 1 - 2aK + a z C*F£ (13) 
Proof. The distance of the primal iterate x^ T+1 ^ to a primal 
optimizer x* can be written as 



\ 2 = \\x^ 



**ll 2 - 



2a 



(PW* (v x £(xM />)) _ v,£(x*, M *)) ,sW - x*) + 



a 



(14) 



where we made use of the fact that x* is a fixed point of the 
iteration (5). Using the relation 



PW® (v x C(x iT \^) - \7 x £(x*,n* 



< 



/3 2 ALx(WO V x C(x^,^) - V x C{x\^) 



equation (14) becomes 

lla^+l)- 3*112 < \\ x (r)_ x *n2_ 



2a(/3W® (v k £(^ t \// t) ) -V x £(x*,//)) ,x {r) - x*) + 



(15) 



In a similar fashion, and using the non-expansive property of 
the projection, we can write the distance of the Lagrangian 
multiplier ^ r+1 ' to its optimal value /i* as: 

||m (t+1) -m*II 2 <IIm (t) -m*II 2 + 

2a (v„C(x^,fi^) - V^*, //), M M - M *} + 

a 2 jv^ix^,^) - V^C(x*,^)f (16) 
Summing up the relations (15)-(16) we obtain 

z *||2 < \\ z (r)_ z *\f_ 

pw 9 



2a 



hi 



l C 2 



(17) 



where C is defined as in (10). The term 

pw 9 



l ($(z< T >) -$(z*)) ,z (r) -z*^ 



can be expanded as 

0W 9 



hi 



hln 



,z^ ~z*J = 
($(z (T ') - $(z*)) + 



(a) 



- /at, 



Cb) 



We can bound the term (a) based on the strong monotonicity 
of in (8), while the term (b) can be bounded as 




/3W® - I Nn 



/3W® - I Nt 



PW® - I Nn 







($(z< T >)-^*)) ,z^-z*\ 



(t) _ ~*||2 



< (/3A max (W)-l)F $ ||^-z 

where we used the Lipschitz continuity of ^(z) in (9). The 
relation (17) then becomes 

||z( T+1) -z*|| 2 < (l-2a K + a 2 C 2 F 2 )\\z^-z*\\ 2 (18) 



with k defined as in (11). Therefore the first convergence 
condition is 1 — 2an + a 2 C 2 F£ < 1, while, since it 
is required that a > 0, the second condition must be 
K > 0. From these two conditions the relations (12) follow. 
Furthermore the convergence rate expression in (13) can be 
established based on (18). □ 
Corollary 1: The convergence conditions (12) on the step 
sizes can be upper-bounded by 



1 



and a < 2 



<P 



F 2 
1 <i> 



(19) 



A max (W0' 

Proof. The proof follows directly from ip/F$ > 0. 

The type of conditions in Corollary 1 are typical in 
(sub)gradient methods and are often referred to as "small 
enough" step size conditions [16]. We may notice that a is 
bounded by quantities related to the characteristics of the 
problem functions, while (3 is related to the structure of the 
information exchange graph. We also note that a has to be 
determined a priori based on the knowledge of the problem 
function properties, while /? can be computed in a distributed 
way by the nodes, since there are distributed algorithms to 
upper-bound \ ma x.(W), e.g. [17]. 

The following technical lemma characterizes the "quality" 
of the regularized optimal solution x* with respect to the 
original primal problem (1): it provides bounds on the 
amount of constraint violation of g(x*) and the suboptimality 
|/ opt — f*\ |. In order to compactly characterize these bounds, 
we define the constraint set of the regularized problem (4) 
as X„ iC , which implies that x* 6 X„ j£ . The set X^. e is 
closed, bounded, and convex, and in general different from 
the original primal constraint set X, being however X C Xj, e . 

Lemma 2: Under the Assumptions 1, 2, 3, and 4 the 
maximum constraint violation is bounded by 



max{0, <?,(£*)} < M^Mf, 



(20) 



where = max ||Va;<7j(x)|| for each i and 



, while the difference between the optimal value 
of the regularized problem (4) and the optimal value of the 
original one (1) can be bounded by 

II/* - /° pt H < MfM^J^+^D 2 (21) 

where Mj = niax ieX „,« l|Va;/(x)||, D = max ie x„ iE ||x|| 

Proof. The proof is a modified version of Lemma 3.3 
in [7]. In particular, the bound (20) follows directly from 
Lemma 3.3 in [7], while the bound (21) requires the modi- 
fication that we consider for Mf and D the maximum of x 
over X„ iC instead of X (which could lead to a too conservative 
result in our case). 

The proof of the bound (21) starts from bounding ||/* — 
/°pt 1 1 by 

ii/* - r pt i i < ii/* - /;=oi i + /;= - r pt (22) 



where /*_ is the optimal cost for the regularization problem 
with regularization parameter e = 0, and /*_ — / opt > 0. 
By convexity of /, we have 

r - /;=„ < v x /(x*) T (x* - < =0 ) (23) 

with x* =Q the unique optimizer of the regularization problem 
with regularization parameter e = 0. Since x*,x* =0 € X„ je 
and Xy >e is compact, by the continuity of the gradient 
||Va;/(x)||, the gradient norm is bounded and we can 
write (23) as 



||r-/ 6 *=oll < max \\V x f(x) 



^£ = 01 



(24) 



Mr 



However, By Proposition 3.1 of [7], we can bound \\x* — 
x*_ 1 1 by M M ^/e/2v and therefore (24) can be written as 



wr -f e=Q \\<M f M, 



(25) 



Using the estimate f* =0 — / opt < ^/2max xe x„ e \\ x \\ 2 > 
which follows directly from the definition of the cost function 
(see Lemma 7.1 of [7]), then (22) can be written as 

||/*-/ opt || < M f Ma\h^+^D 2 

B. Non-symmetric Case 

In this subsection, we consider the case of non-symmetric 
matrices W . It is not difficult to see that all the previous 
derivations hold true with the small modification that instead 
of A max (-) we will have cr max (-), meaning the largest singu- 
lar value. Unfortunately, due to the term (J max {j3W — In), 
the condition on (3 is not solvable in closed-form. Define 

C := max(/3a max (W0, 1), F<s,<T max (pW - I N ) 

then the conditions in (12) for the non-symmetric case 
become 

2k' 



-X/3W-In) < 



and 



a < 



C. Weight Design 

Instead of a unique step size j3, one may consider de- 
signing the whole information exchange weight matrix W. 
For simplicity we redefine the weight matrix as W := (3W 
whose pattern is fixed by the network structure (assumed 
here to be symmetric) but the single entries are variables 
to be determined. If we use W in the iterations (5)-(6), the 
convergence conditions on W (in addition to the one on a) 
can be written as 



\ 2 (W) > 

A max (TF) < l 



W + {l/N)l N l^yO 



These conditions are similar to those in [8]. In particular, 
the first condition is a connectivity condition, while the 
second could be interpreted as diagonal dominance. Using 
the fact that (W + {I/N^nI^^W = (I - (1/N)1 N 1 N ) 
and therefore W(W + (I/N^nI^-W = W, by Schur 
complement these relations can be translated into the LMI [8] 



W 



(1/N)1 N 1 N 

w 



w 
1 



>- 



(26) 



making the weight design a centralized convex problem 3 . 

V. A Robotic Network Application 

In this section we use an application scenario inspired by 
a realistic problem to illustrate the proposed method. We 
consider a group of mobile robots that can communicate 
among each other via a communication network. Let the 
graph that describes the network be Q = (V, £ ) and we 
will assume that it is time-invariant. Let Xi(k) € M 2 be the 
position of the robot i at the discrete time step k. Let y(k) € 
M 2 be the position of a moving target at the discrete time step 
k. We model the robot dynamics as single integrator systems 
and associate the convex cost function fi(xi(k) — Xi(k — 1)) 
with each of them that can represent the energy consumption. 
We assume the robots know the target location. 

We are interested in moving the robots to ensure that the 
target is always in the barycenter of their positions. Further- 
more, we require that robots connected by an edge in the 
fixed graph have a bound R on their maximal distance (for 
communication purposes). We limit the allowable change 
of position in one step ||x;(fc) — Xi(k — 1)|| by u max ,i to 
model physical limitations. Finally, our global objective is 
to meet the aforementioned requirements while minimizing 
the total energy consumption. At each discrete time k, the 
above problem can be written as 



minimize 

;i(k),...,x N {k) 



Eili fi{xi{k) -Xi{k-1)) 
subject to \\x l {k) - x. } {k)\\ 2 - R 2 < 

for all e £ 
\\xi(k) -Xi(k-1)\\- v maXti < 

for i = 1,...,N 
=!/(*) 

which is a specific instance of (1) for each time step k. Since 
the target is moving, y(k) corresponds to a time-varying total 
available resources xtot in the formulation of problem (l) 4 . 

3 For a more detailed analysis on the weight selection criteria the reader 
is referred to [8]. 

4 We note that, when the proposed saddle-point algorithm is used to solve 
the problem (27) at each discrete time step k, each initial Xi(fc)(°> can be 
chosen as Xi(fc)( ' = Xi(k — 1) + (y(k) — y(k — 1)), whereas Xi(0)' ' = 
2/(0). This ensures that the initial iterates satisfy the resource allocation 
constraint. 
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Fig. 1. Representation of the trajectories of the robots while the target moves (blue thick line). The initial graph and positions of the robots are marked 
in black, while the final configuration is marked in red. The notation indicates the j-th component of Xi. 



Our simulation example consists of N = 7 robots con- 
nected via a communication graph shown in Figure 1 with 
Laplacian matrix L. The parameters of the scenario are 
R = 1.2, and fi{5xi(k)) = (Qi8xi(k),5xi(k)), where 
Sxi(k) = Xi(k) — Xi(k — 1) and Qi = 1 for all i except 
for i = 6, for which Qq = 0. We consider i> ma x,6 = 0.5, 
while the others are set to +oo. Given the fact that the 
cost function is not strictly convex and the position of the 
robots are coupled via a resource allocation constraint, even 
this small-size problem could be difficult to solve (in terms 
of communication/computation requirements) for common 
gradient algorithms. This makes this example interesting to 
analyze with the proposed approach. 

We solve problem (27) via the regularized saddle-point 
algorithm with v = 10, e = .01, and W — L, a = 0.01, 
and j3 = 0.2. Figure 1 shows the computed trajectories 
of the robots while the target moves (blue thick line). The 
initial graph and positions of the robots are marked in black, 
while the final configuration is marked in red. In order to 
assess the "quality" of the regularized problem solution with 
respect to the original primal one, the maximal error of 
the optimal robot positions maxi ||a;*(fc) — x opt (/c)|| was 
computed and resulted in 0.02, which is acceptable in this 
application scenario. Finally, we report that the total number 
of communication/computation iterations per discrete time 
step k was r = 2000, and the computations required around 
0.03 s per node per discrete time step k, on an Intel Core 
i5 (2.3 GHz and 4GB DDR3) laptop. These results are 



encouraging since the regularization parameters were not 
specifically optimized to minimize the number of iterations. 
This aspect, along with extensive comparisons with common 
gradient methods are left as future development. 5 

VI. Conclusions 

We have presented a regularized saddle-point algorithm 
that solves convex optimization problems with resource 
allocation constraints in a distributed fashion. Convergence 
conditions for the step sizes of the method were derived and 
their relations to the properties of the graph and the opti- 
mization problem were shown. Finally, we have illustrated 
our proposed algorithm with a robotic network application 
scenario. A more in-depth study on the optimal design of the 
step-sizes and the regularization parameters, more extensive 
simulation studies on a variety of application scenarios, as 
well as time-variant communication networks are among the 
future research directions. 
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Appendix 

Bound on the primal iterates x 

Although we have not encountered any problems in the 
simulation study of Section V, in general it is difficult to 
guarantee theoretically that the primal iterations x^ stay 
bounded for all r. This is due to the fact that we cannot 
simply project the iterations (5) on X without destroying the 
properties of the information exchange matrix W. However, 
we show here that it is possible to find a simple way to 
enforce the boundedness of the iterates by the closed, convex, 
and bounded set X. We remark that this boundedness con- 
dition plays the important role of ensuring that the quantity 
based on x stays finite. Define X as the ball 

jv 

i = U {xi\ \\xi\\ <Xi) 
i=i 



Consider the simple modification of the regularized 
Lagrangian-type function C(x,fi), for some scalar t > 0, 
as 

N 

£{x, n, t) = C(x, ft) - (1/t) log(Xi - INI) (28) 

i=l 

where we use a logarithmic barrier to enforce the ball 
constraint. Moreover, consider the modified problem: 

maxmin C(x,/i,t) (29) 

fj, X 

subject to (l^ ®I n )x = xtot 

Assume that the following reasonable conditions hold: 
CI) The first iterate z (0) is in X; 

C2) The set X^ is a subset of X and the following 
inequalities hold: 

max I Id I < NXi <C t 

C3) the step-size a is sufficiently small to guarantee that 
log(A > i — \ \xi~ ||) stays finite for any r. 
Theorem 2: Under Conditions C1-C3 and using the mod- 
ified function C(x, ft,, t) instead of C(x, /i) in the iterates (5)- 
(6), the primal iterates x^ are contained in the closed and 
bounded convex set X. In addition, 

i) the optimizer of the modified problem (29) is approx- 
imately the same as the optimizer x* of the original 
problem (4); 

ii) the difference between the optimal value of the modi- 
fied problem (29), C* and the optimal value C* of the 
original problem (4) is approximately 

N 

C*-C* w-5^bg(Xi)/t (30) 

i=l 

which goes to zero when t — > oo. 
Proof. Conditions C1-C3 immediately imply that the primal 
iterates x^ T ' are bounded in the closed, convex, and bounded 
set X. Consider now the optimal point (x* , ft* ) of the original 
problem (4), for which it holds that 

N 

C(x*,fx*,t)=C{x*,n*)-{l/t)J2^g(X i -\\x*\\) 

i=l 

N 

« (C2) C(x*^*)-J2^g(X t )/t 
1=1 

from which the claims i) and ii) of the theorem follows. □ 



